Anaphora Resolution of Japanese Zero Pronouns with Deictic Reference
نویسندگان
چکیده
This paper proposes a method to resolve the reference of deictic Japanese zero pronouns which can be implemented in a practical machine translation system. This method focuses on semantic and pragmatic constraints such as semantic constraints on cases, modal expressions, verbal semantic attributes and conjunctions to determine the deictic reference of Japanese zero pronouns. This method is highly effective because the volume of knowledge that must be prepared beforehand is not very large and its precision of resolution is good. This method was implemented in the Japanese-to-English machine translation system, ALT-J/E. According to a window test for 175 zero pronouns with deictic referent in a sentence set for the evaluation of Japanese-to-English machine translation systems, all of zero pronouns could be resolved consistently and correctly. 1 Introduction In all natural language, elements that can be easily deduced by the reader are frequently omitted from expressions in texts (Kuno, 1978). This phenomenon causes considerable problems in natural language processing systems. For example in a machine translation system, the system needs to recognize that elements which are not present in the source language, may become mandatory elements in the target language. In particular, the subject and object are often omitted in Japanese; whereas they are often mandatory in English. Thus, in Japanese-to-English machine translation systems, it is necessary to identify case elements omitted from the original Japanese (these are referred to as "zero pronouns") for their translation into English expressions. Several methods have been proposed with re-When considering the application of these methods to a practical machine translation system for which the translation target area can not be limited , it is not possible to apply them directly, both because their precision of resolution is low as they only use limited information, and because the volume of knowledge that must be prepared betbre-hand is so large. The zero pronouns that must be resolved by a machine translation system carl be classitied into 3 types; (a) zero pronouns with antecedents within the same sentence (intrasenteutial), (b) zero pronouns with antecedents elsewhere in the text (in-tersentential) and (c) zero pronouns with deictic reference (extrasentential). Regarding type (b), Nakaiwa and Ikehara (1992) proposed a method to determine the intersentential antecedents using verbal semantic attributes. The rules used in this method are independent of the field of the source text. Therefore, anaphora resolution may be conducted with a relatively small volume of knowledge, making the proposed …
منابع مشابه
Automatic Extraction Of Rules For Anaphora Resolution Of Japanese Zero Pronouns From Aligned Sentence Pairs
This paper proposes a method to extract rules for anaphora resolution of Japanese zero pronouns from aligned sentence pairs. The method focuses on the characteristics of Japanese and English in which both the language families and the distribution of zero pronouns are very different. In this method, zero pronouns in the Japanese sentence and the English translation equivalents of their antecede...
متن کاملResolving Discourse Deictic Anaphora in Dialogues
Most existing anaphora resolution algorithms are designed to account only for anaphors with NP-antecedents. This paper describes an algorithm for the resolution of discourse deictic anaphors, which constitute a large percentage of anaphors in spoken dialogues. The success of the resolution is dependent on the classification of all pronouns and demonstratives into individual, discourse deictic a...
متن کاملA Probabilistic Method for Analyzing Japanese Anaphora Integrating Zero Pronoun Detection and Resolution
This paper proposes a method to analyze Japanese anaphora, in which zero pronouns (omitted obligatory cases) are used to refer to preceding entities (antecedents). Unlike the case of general coreference resolution, zero pronouns have to be detected prior to resolution because they are not expressed in discourse. Our method integrates two probability parameters to perform zero pronoun detection ...
متن کاملA Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution
This paper presents a probabilistic model for Japanese zero anaphora resolution. First, this model recognizes discourse entities and links all mentions to them. Zero pronouns are then detected by case structure analysis based on automatically constructed case frames. Their appropriate antecedents are selected from the entities with high salience scores, based on the case frames and several pref...
متن کاملAre Pronouns Always Zero in Zero Pronominal Languages? : The Case in Japanese
This paper presents the results of a comparative text analysis of English anaphoric pronouns & their Japanese counterparts. It first demonstrates that although anaphoric pronouns in English are often compared to zero pronominals in Japanese, not all, but only some, of the English anaphoric pronouns have their counterparts in zero pronominals in the same context in Japanese, and then argues that...
متن کامل